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Abstract 

Background: Little research has explored who responds better to an automated vs. human advisor for health 
behaviors in general, and for physical activity (PA) promotion in particular. The purpose of this study was to explore 
baseline factors (i.e., demographics, motivation, interpersonal style, and external resources) that moderate 
intervention efficacy delivered by either a human or automated advisor. 

Methods: Data were from the CHAT Trial, a 12-month randomized controlled trial to increase PA among 
underactive older adults (full trial N = 218) via a human advisor or automated interactive voice response advisor. 
Trial results indicated significant increases in PA in both interventions by 12 months that were maintained at 
18-months. Regression was used to explore moderation of the two interventions. 

Results: Results indicated amotivation (i.e., lack of intent in PA) moderated 12-month PA (d = 0.55, p < 0.01) and 
private self-consciousness (i.e., tendency to attune to one's own inner thoughts and emotions) moderated 
1 8-month PA (d = 0.34, p < 0.05) but a variety of other factors (e.g., demographics) did not (p > 0.1 2). 

Conclusions: Results provide preliminary evidence for generating hypotheses about pathways for supporting later 
clinical decision-making with regard to the use of either human- vs. computer-delivered interventions for PA 
promotion. 

Keywords: Physical activity, Intervention, Moderation, Interactive voice response, Behavioral intervention technology 
system, Targeting 



Regular physical activity is a key factor in the prevention 
of a variety of chronic diseases (e.g., atherosclerotic heart 
disease, hypertension, diabetes, obesity), yet the majority 
of middle-aged and older adults remain insufficiently ac- 
tive [1]. Cost-effective and efficient methods for promoting 
physical activity that have greater public health impact are 
needed [2,3] . Exploring moderators of intervention success 
can improve the efficacy of health behavior interventions 
by identifying "...for whom or under what conditions the 
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treatment works" [p 878 [4]]. Identifying subgroups who 
respond differently to different types of interventions (e.g., 
based on intervention content or means of delivery) is a 
key priority in the physical activity promotion field as it 
fosters development of more focused and targeted inter- 
ventions [5-9] that, by extension, should ultimately lead to 
more cost-effective and efficient strategies for physical 
activity promotion [9]. It ought to be possible to target 
specific interventions based on a variety of personal, social, 
and environmental factors that can be assessed prior to 
starting a trial. 

A rapidly growing body of research is exploring the use 
of various technologies such as interactive voice response 
systems, text messaging, web pages, and smartphones for 



© 201 3 Hekler et al.; licensee BioMed Central ftd. This is an Open Access article distributed under the terms of the Creative 
Commons Attribution ticense (http://creativecommons.0rg/licenses/by/2.O), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Hekler et al. International Journal of Behavioral Nutrition and Physical Activity 2013, 10:109 
http://www.ijbnpa.org/content/! 0/1 /1 09 



Page 2 of 10 



promoting physical activity and other health behaviors 
[10-14]. Much of this previous research has largely fo- 
cused on determining if behavior change technologies can 
effectively promote physical activity. As there is increasing 
evidence that they can promote physical activity at com- 
parable levels to human counseling [10,15], an important 
follow-up is to identify which intervention works better 
for whom. Specifically, if two interventions are found to 
be equally effective, other factors that determine res- 
ponsivity or success become important for supporting 
clinical decision making. 

The Community Health Advice by Telephone (CHAT) 
trial tested the efficacy of 12-month human vs. automated 
computer advisors that delivered physical activity inter- 
ventions relative to a time-matched health education con- 
trol [10,15]. In the CHAT trial, the content, timing, and 
delivery channel (i.e., both via the telephone) of the inter- 
ventions were identical, with only the type of advisor (i.e., 
human vs. automated) differing between the two groups. 
Study results showed that both intervention strategies 
promoted significant increases in moderate-to-vigorous 
intensity physical activity (Mod+) relative to the health- 
education control over the course of 12 months, with 
mean minutes of Mod + for both intervention arms 
above the national guidelines of 150 minutes per week 
[16]. Additional results revealed that both arms maintained 
physical activity gains at 18 months with only minimal, 
self-initiated contact with their assigned human or auto- 
mated advisor [17]. 

Since results indicate that both the human and auto- 
mated advisors effectively promoted physical activity 
initiation and maintenance, a secondary aim of the 
CHAT trial was to evaluate potential moderators of the 
intervention-physical activity relationship to provide 
preliminary empirical evidence to support later clinical 
decision making for determining which intervention is 
"right" for whom [p 102, [15]]. Based on this, a number of 
psychosocial variables were measured at baseline within 
the CHAT trial to explore who responds best to a human 
vs. technology delivered intervention. Overall, the data 
from the CHAT trial is an excellent dataset for generating 
hypotheses about informing later clinical decisions about 
which intervention to recommend to whom as both inter- 
ventions share the same content, amount of contact, deli- 
very channel, both were optimized for their own delivery 
modality (e.g., while humans followed a script, they were 
still allowed to personalize the advice) and both were 
found to be similarly efficacious at promoting 12-month 
physical activity increases and 18-month maintenance of 
physical activity [17]. 

Few if any studies have explored who may respond 
better to a technology-based intervention compared to a 
human delivered intervention for physical activity initi- 
ation and maintenance. As discussed by King et al. [9], a 



variety of domains could conceivably be used to target 
an intervention such as demographic or psychosocial 
factors. In addition, there is a growing body of research 
emphasizing the importance of not just initiation of a 
health behavior but maintenance as well [18], and that 
baseline factors that are important in maintenance com- 
pared to initiation may be different [9,19]. As there is lit- 
tle to no evidence exploring key target variables for 
predicting differential intervention responses to techno- 
logy vs. human interventions for physical activity initi- 
ation or maintenance, we chose to focus on four broad 
domains (i.e., demographics, motivation, interpersonal 
style, and social resources) based on more general re- 
search on targeting [9,19] and a priori expectations [15]. 
We chose these four broad domains largely based on 
known differences in communication style that were 
present between our human vs. automated advisor inter- 
ventions. Specifically, while the automated advisor pro- 
vided the same intervention content (e.g., goal setting), 
it could not provide the full realm of interactions that 
are possible between humans. For example, even though 
the human advisor followed a structured script and 
protocol, the human advisors could still provide subtle 
shifts in tone or slightly rephrased advice that could be 
more in line with the current needs of a person com- 
pared to an automated advisor. Alternatively, there 
might be conditions in which the automated advisor's 
utilitarian focus on physical activity promotion without 
any other detours in conversation could actually be 
beneficial for certain individuals. 

With regard to demographics, we chose to specifically 
explore if age, gender, ethnicity, or marital status might 
impact differential responses. Among these demograph- 
ics, we believed gender would be the most important as 
we hypothesized that women may implicitly respond 
better to a human intervention whereas men may re- 
spond better to an automated advisor. We also believed 
that age might play a role as those who are older may 
have had a harder time working with the automated ad- 
visor and thus not respond as well. That said, we were 
also working in a restricted age range population (i.e., 
already older adult group) and thus had a restricted 
range to explore this moderation effect. We included the 
other demographics to foster hypothesis generation and 
because if they were significant moderators, they could 
easily be utilized for targeting in clinical research. 

With regard to motivation, self-determination theory 
[20] was the guiding theoretical model for the selected 
measures. We hypothesized that intrinsic motivation/au- 
tonomy may differentially impact responsiveness to a 
human vs. automated advisor (e.g., individuals with 
lower autonomy/intrinsic motivation at baseline would 
require a human advisor). We hypothesized this because 
we believed it possible that the human advisors, even 
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after closely following the protocol, might have been able 
to expand their repertoire of discussion points to coax 
less motivated individuals to action better than an auto- 
mated advisor. With regard to interpersonal style, we 
postulated that persons who experienced social anxiety 
would fare better with the automated advisor as socially 
anxious people feel discomfort in the presence of others 
and avoid those interactions, but an automated advisor 
may not trigger this experience. Finally, we focused on 
baseline social support because it may function as a 
"buffer" for reduced social contact or connectedness 
when counseling is delivered through an automated ad- 
visor. It is important to emphasize, however, that these 
hypotheses were based largely on our interpretation of 
behavioral theory and not previous empirical data. As 
such, we see this work as primarily hypothesis gene- 
ration rather than confirmatory hypothesis testing. 

Methods 

Study design 

The methods and the results, including descriptions 
reported to conform to the CONSORT guidelines for 
clinical trials, for the Community Health Advice by 
Telephone (CHAT) trial have been described in detail 
elsewhere [10,15-17]. In brief, the CHAT trial evaluated 
the 12-month effects, relative to a control arm, of two 
telephone-delivered physical activity guidance and sup- 
port interventions. Following completion of this trial 
phase, participants in the two intervention arms were 
followed for an additional 6-month period with only 
self-initiated contact with their respective advisors (i.e., 
either the human or automated advisor). Study results 
revealed significant improvements in Mod + for both 
study arms over 12 months [16] that were maintained at 
18 months [17]. Blinded evaluation occurred at baseline, 
6, and 12 months and an additional assessment was 
conducted at 18 months for the two intervention arms 
only. The Stanford Office of Human Subjects Research 
approved the study protocol. 

Interventions 

The counseling content delivered in both intervention 
arms was based on Social Cognitive Theory and the 
Transtheoretical Model [10,15]. Individuals randomized 
to the automated advice arm received the same interven- 
tion content, schedule of contact, number of contacts, 
and time per contact as the human advice arm. Full de- 
tails, including descriptive statistics on appropriate fide- 
lity checks to ensure similar content between the two 
groups have been reported previously [10,15,17] and will 
be described briefly here. Specifically, both the human 
advisor and automated advisor were scheduled to con- 
tact the participants 15 times during the intervention 
year. After the 12 month period, participants were given 



the option to call their advisors as they saw fit. As 
reported previously [17], significantly more individuals 
called in to the Human arm during the Maintenance 
phase (31.9% [n = 23] never called in) compared with the 
automated arm (58.1% [n = 43] never called in; f=3.4, 
p < .001). 

The following content was delivered by both the human 
and automated advisor: physical activity assessment, 
progress evaluation, individualized problem-solving, goal- 
setting, feedback, and delivery of positive support and 
tailored advice based on an individual's stage of change. 
Further, the human advisors were given scripts to fol- 
low that were parallel to the automated advisor scripts. 
The human advisor scripts provided specific content 
areas to cover and suggested language to use to cover 
each content area that was identical to the automated 
advisor. The scripts for the human advisors mimicked 
the logic used for determining the appropriate content 
to be delivered to the participant via the automated ad- 
visor. For example, the script for the human advisor in- 
cluded strategies for assessing a participant's current 
stage of change (e.g., contemplation, preparation, ac- 
tion). After this determination, specific stage-tailored 
scripts were used to provide the content for both the 
human and automated advisor. The users of the auto- 
mated advisor communicated using the touch-tone key- 
pad of their telephones [21,22], whereas the human 
advisors engaged in live verbal conversation. Since the 
human advisors used live verbal communication, some 
deviation from the scripts was allowed as already men- 
tioned. All human counselors were women with at least 
a bachelor's degree and were supervised by CMC. Simi- 
larly, the automated advisor was a pre-recorded women's 
voice of a professional "voice talent" who read aloud 
all of the scripts. The automated advisor "spoke" to par- 
ticipants over the telephone using computer-controlled 
"sound bites" of the prerecorded statements and 
questions. 

A variety of fidelity checks were accomplished through- 
out the study to further ensure the interventions were 
consistent [10,15]. Specifically, all human counseling ses- 
sions were recorded and regularly evaluated via an inde- 
pendent review of counselor summary forms at the time 
of each contact and independent review of audio-taped 
telephone contacts. Approximately one third of total 
telephone contacts, selected at random, were reviewed 
by other counseling personnel for independent coding 
and verification that the counselors were not straying 
from the script and content. Fidelity checks for the au- 
tomated advisor included semiweekly evaluation of the 
technical performance reports generated by the system 
(e.g., information about call rates, length of calls, etc) 
and also daily monitoring of the automated system's 
"help line," which was used to allow participants to 
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report any problems with the automated advisor. Descrip- 
tive statistics on the fidelity between the automated advisor 
and human advisor were reported previously [10]. Briefly, 
the two groups had similar number of completed calls (i.e., 

M-humanadvisor ~ 13.1 ± 2.5 VS. M-automatedadvisor ~ 11.8 ± 4.8 

calls completed), the human advisors did have a signifi- 
cantly longer call length compared to the automated ad- 
visor (i.e., M-humanadvisor ~ 10.7 i 5.0 VS. ^lautomatedadvisor ~ 

6.6 ± 2.2 min/call). Overall fidelity checks on the quality of 
the content suggested good fidelity to the intervention 
content and script between the human advisor and auto- 
mated advisor. 

Participants 

Eligibility criteria for the study were: ages 55 years and 
older; physically underactive (i.e., not engaged in more than 
60 min/week of Mod + over the previous 6 months), free 
of any medical condition that would limit participation in 
exercise, and able to speak and understand English suffi- 
ciently to provide informed consent and participate in study 
intervention and assessment procedures. Eligible individuals 
were randomized to a 12-month home-based moderate- 
intensity physical activity (primarily walking) adoption pro- 
gram delivered via a trained telephone counselor (Human 
Advice arm, n = 73), a similar program delivered via an 
automated, computer-controlled interactive telephone sys- 
tem (Automated Advice arm, n = 75), or a general health 
education control arm (n = 70). For these analyses, we 
only focused on the human and automated advice arms. 

Measurement of physical activity 
Stanford 7-day physical activity recall (PAR) 

The 7-Day PAR [23] was the primary measure of phys- 
ical activity. The PAR is a well validated structured inter- 
view that asks participants to estimate their total time 
during the past seven days engaging in four physical ac- 
tivity intensity categories: sleep, moderate, hard, and 
very hard physical activity [24]. The PAR has been 
shown to have good reliability [25], and concurrent va- 
lidity [26] at these intensities. The primary outcomes for 
these analyses were 12-month and 18-month PAR esti- 
mates of energy expenditure from Mod + (in kcal/kg/day). 

Measurement of proposed moderator variables 

Proposed moderators included demographics (i.e., age, 
gender, ethnicity, and marital status) and physical activity- 
specific measures of motivation based on self-determination 
theory (e.g., amotivation, autonomous motivation), interper- 
sonal style (i.e., social anxiety and self-consciousness), and 
social support. These variables were assessed at baseline 
prior to study arm random assignment. 

Self-determination theory constructs [20] were mea- 
sured using the Treatment Self-Regulation Question- 
naire for Exercise (TSRQ-E) [27]. This measure includes 



the following subscales: autonomous motivation (e.g., 
"I would exercise regularly because I feel that I want to 
take responsibility for my own health," current sample a = 
0.87), introjected regulation (e.g., "I would exercise regu- 
larly because I would feel guilty or ashamed of myself if 
I did not exercise regularly," current sample a = 0.83), ex- 
ternal regulation (e.g., "I would exercise regularly because 
I feel pressure from others to do so," current sample a = 
0.72), and amotivation (e.g., "I really don't think about it," 
current sample a = 0.49). Internal consistency of the items 
was very good across a large sample (N > 2000, as > 0.73) 
and the measures evidenced good validity [27] . 

Self consciousness and social anxiety were measured 
using the self-consciousness survey-revised (SCS-R [28]). 
The self-consciousness survey-revised is a validated meas- 
ure that includes subscales on private self-consciousness 
(e.g., "I am always trying to figure myself out," current 
sample a = 0.76), public self-consciousness (e.g., "I usu- 
ally worry about making a good impression," current 
sample a = 0.80), and social anxiety (e.g., "It takes me 
time to get over my shyness in new situations," current 
sample a = 0.75). Previous research indicates that it is a 
reliable and valid scale [28] . 

Social support was assessed using the social support 
for exercise (SSE) scale [29]. The SSE is a well-validated 
and widely used scale that measures social support for 
being physically active. It has been used across a wide 
variety of demographic groups including older adults 
[30]. It includes subscales on the supportiveness of both 
family (current sample, a = 0.92) and friends (current 
sample, a = 0.87). 

Statistical analyses 

Linear regression was used to conduct these analyses 
[31]. As the primary questions focused on differences 
between the two active interventions, the attention- 
control arm was not included in any analyses. All pri- 
mary moderator variables of interest (i.e., demographics, 
subscales from the TSRQ-E, SCS-R, SSE, and study arm 
assignment) were centered and then an interaction term 
was calculated between the moderator variable and 
study arm assignment [32]. The primary outcome vari- 
ables were 12-month and 18-month average daily PAR- 
based Mod + energy expenditure values. For 12-month 
outcome models, baseline PAR values were included in 
the models as a covariate. For 18-month analyses, a 
residualized change score, which represented the unique 
variance of change in physical activity from baseline to 
12 months [33], was entered into the model as a covari- 
ate to control for any changes that occurred during the 
intervention. We incorporated intent-to-treat principles 
for the primary outcomes of interest, whereby, for 
participants with missing or incomplete PAR data at 
12 months (n = 21) and 18 months (n = 33; 18 in Human 
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arm, 15 in Automated arm, between-arm difference p > 0.10), 
their baseline values were used. For some participants 
missing 12-month data (n = 8), secondary physical activity 
data sources were available (i.e., 12-month CHAMPS 
questionnaires, logged reports of physical activity occur- 
ring during the 7-12 month intervention period that were 
collected as part of all exercise advisor [automated or 
human] interactions). If the data from these secondary 
sources indicated that participants were at least as active 
as their 6-month PAR data indicated, then the 6-month 
values were used [see: 10, [34] for a full discussion on this 
imputation method]. The intent-to-treat analyses are 
reported for the PAR 12-month and 18-month outcomes. 
Alpha was set at .05 using a two-tailed test of significance 
for all interactions. 

Results 

Descriptive statistics 

Table 1 summarizes demographic characteristics, other 
baseline moderator variables, and 12-month and 18- 
month physical activity variables. Participants (n = 148) 
were mostly Caucasian, women, overweight, and had a 
mean age of 60.7 ± 5.9 years. There were no significant 



differences between the two arms at baseline on any demo- 
graphic variables. Participants in the human advisor arm, 
on average, reported greater private self-consciousness at 
baseline relative to the automated advisor arm (p < 0.05). 
There were no other significant differences between the 
human and automated advisor arms at baseline (see 
Table 1). 

Moderator analyses 

At 12 months, results indicated that baseline amotiva- 
tion moderated intervention effects {d = 0.55, ji = 0.27, 
p = 0.006). Examination of the plot (see Figure 1) suggests 
that participants reporting more baseline amotivation (i.e., 
lack of intent or interest in being physically active) to exer- 
cise responded better to a human, whereas those who 
reported less baseline amotivation (i.e., more initial phy- 
sical activity interest) responded better to automated ad- 
vice. No other variable significandy moderated 12-month 
physical activity levels (ps > 0.12). 

At 18 months, analyses revealed that private 
self-consciousness significantly moderated 18-month 
maintenance of physical activity (d = 0.34, /? = 0.15, p = 
0.04). Specifically, those who were high in private self- 



Table 1 Demographics, baseline moderators, and 12 m physical activity 



N 




Automated 




Human 








All 






75 






73 






148 




Demographics 














p 








White/Caucasian, n (%) 




67 (90.5%) 




59 (80.8%) 


0.09 


126 


(85.7 


%) 


Women, n (%) 




49 (65.3°/ 


6) 




50 (68.5% 


0 


0.68 


99 


(66.9 C 


Yo) 


Married/Co-habituating, n (%) 




52 (70.3°/ 


6) 




49 (67.1% 


.) 


0.63 


101 (68.7 


%) 


drop-out, n (%) 




6 (8.0%; 


1 




5 (6.9%) 




0.79 


11 


(7.4°/ 


i) 




M ± SD 


Min-Max 


M ± 


SD 


Min-Max 


P 


M ± SD 




Min-Max 


Years In School 


16.4=1 


: 1.8 


12.0-18.0 


16.2d 


: 1.9 


12.0-18.0 


0.63 


16.3 + 1.8 




12.0-18.0 


Age 


61.1 i 


= 5.6 


55.0-77.0 


60.4; 


z6.3 


52.0-78.0 


0.49 


60.7 ± 5.9 




52.0-78.0 


BMI 


29.2 d 


= 5.1 


17.9-39.3 


29.9 d 


= 5.2 


19.1-42.3 


0.40 


29.5 ± 5.2 




1 7.9-42.3 


Baseline Mod + (kcal/kg/day) 


0.7 ± 


1.1 


0.0-6.3 


0.8 ± 


1.0 


0.0-4.1 


0.67 


0.8 ± 1 .0 




0.0-6.3 


Baseline Moderators 






















Amotivation for exercise (1-7) 


2.0 ± 


1.0 


1 .0-7.0 


2.1 ± 


1.1 


1 .0-6.0 


0.67 


2.0 ± 1 .0 




1.0-7.0 


Autonomy for exercise (1-7) 


6.1 ± 


0.9 


3.7-7.0 


62 ± 


0.9 


3.2-7.0 


0.79 


6.2 ± 0.9 




3.2-7.0 


htrojected motivation for exercise (1-7) 


3.5 ± 


1.7 


1 .0-7.0 


3.6 ± 


1.5 


1 .0-7.0 


0.79 


3.6 ± 1 .6 




1 .0-7.0 


External motivation for exercise (1-7) 


2.3 ± 


1.2 


1 .0-6.3 


2.5 ± 


1.1 


1 .0-5.5 


0.21 


2.4 ± 1 .2 




1 .0-6.3 


Family social support for exercise (0-75) 


34.7 ± 


13.3 


0.0-75.0 


32.6 ± 


15.4 


0.0-73.0 


0.39 


33.7 + 14.4 




0.0-75.0 


Friend Social Support for Exercise (0-75) 


29.0 ± 


10.1 


0.0-57.7 


29.0 d 


= 9.0 


0.0-57.0 


1.00 


29.0 ± 9.5 




0.0-57.7 


Social anxiety (0-18) 


6.1 ± 


4.1 


0.0-17.0 


7.4 ± 


4.3 


0.0-1 7.0 


0.07 


6.7 ± 4.2 




0.0-17.0 


Private self-consciousness (0-26) 


12.9 =1 


= 4.7 


0.0-23.0 


14.5 □ 


= 5.5 


0.0-26.0 


0.07 


13.7 + 5.2 




0.0-26.0 


Public self-consciousness (0-20) 


10.3 d 


= 4.6 


0.0-20.0 


11.52 


= 5.5 


0.0-20.0 


0.10 


10.9 + 4.7 




0.0-20.0 


12 m PAR Mod + (kcal/kg/day) 


1.5 ± 


1.5 


0.0-7.6 


1.6 + 


1.2 


0.0-5.6 


0.93 


1 .6 ± 1 .4 




0.0-7.6 


18 m PAR Mod + (kcal/kg/day) 


1.4 ± 


1.6 


0.0-9.6 


1.5 ± 


14 


0.0-5.6 


0.14 


1.5 + 1.5 




0.0-9.6 



p values represent f-test or Chi-square analyses comparing automated vs. human study arms; Mod + moderate- to vigorous-intensity physical activity. 
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■ Low A mot 

HighAmot 



Automated computer advisor Human advisor 

Figure 1 Amotivation moderating 12 month physical activity. Note: Results above are predicted values based on a multiple regression 
model that included arm (t = 0.11, df= 147, p = 0.91), amotivation (f = -1.2, df= 147, p = 0.24), baseline moderate- to vigorous-intensity physical 
activity (Mod+) (f = -7.0, df= 147, p < 0.0001), and arm X amotivation (f = 3.31, df= 147, p = 0.001 2). Using standard conventions for modeling 
interaction effects, the y-axis represents the predicted value at 12 months of physical activity (i.e., kilocalories burned per kilogram per day that 
was in the moderate to vigorous intensity range) for those high and low in amotivation between the automated advisor and human advisor 
groups. Low Amot = One Standard deviation below mean for amotivation; High Amot = One standard deviation above the mean 
for amotivation. 



consciousness and randomized to the automated advisor 
did not maintain their physical activity levels as well as 
all other types of individuals (i.e., those low in private 
self-consciousness in the automated advisor arm and all 
individuals randomized to the human advisor arm, see 
Figure 2). Although not statistically significant, results 
also suggested a trend for baseline family social support 
moderating maintenance of physical activity at 18 months 
(d = 0.31, /3 = 0.13, p = 0.07). Specifically, those individuals 
who were high in family social support and who had been 
randomized to the human advisor maintained their phys- 
ical activity levels better than all others (see Figure 3). 

Discussion 

These secondary analyses indicated two moderators of 
physical activity based on intervention delivery method, 
with different moderator results at 12 and 18 months. 



At 12 months, individuals with high amotivation (i.e., 
less interested in physical activity at baseline) engaged 
in more physical activity at 12 months when assigned to 
a human advisor whereas those with low amotivation 
(more initial interest in becoming more active) responded 
better to an automated advisor. At 18 months, private 
self-consciousness moderated treatment response, with 
those high in private self-consciousness (i.e., individuals 
who engaged in more thinking and discussing matters of 
the self) exhibiting poorer maintenance outcomes when 
randomized to the automated advisor compared to the 
human advisor. To the best of the authors' knowledge, 
this is the first set of analyses suggesting that baseline 
psychosocial factors can moderate initial and longer- 
term efficacy of two interventions with similar content 
and contact time, but delivered with different commu- 
nication modalities. 




1L0 PSC 

Hi PSC 



Automated computer advisor Human Advisor 

Figure 2 Private self-consciousness moderating 18 month physical activity. Note: Results above are predicted values based on a multiple 
regression model that included Arm (t = 0.72, df= 147, p = 0.47), private self-consciousness (t = -1.02, df= 147, p = 0.31), residualized change score 
of physical activity from baseline to 12 months (f = 6.25, df= 147, p < 0.0001), and arm X private self-conscious (f = 2.04, df= 147, p < 0.05) 
predicting moderate- to vigorous-intensity physical activity (Mod+) at 18 months. Using standard conventions for modeling interaction effects, 
the y-axis represents the predicted value at 18 months of physical activity (i.e., kilocalories burned per kilogram per day that was in the moderate 
to vigorous intensity range) for those high and low in private self-consciousness between the automated advisor and human advisor groups. 
Low PSC = one Standard deviation below mean for private self-consciousness; High PSC = One standard deviation above the mean for private 
self-consciousness. This is based on previous conventions for graphing moderation analyses as described above. 
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1 Lo Fam SS 
Hi FamSS 



Automated computer advisor Human Advisor 

Figure 3 Trend result of family social support moderating 18 month physical activity. Note: Results above are predicted values based on a 
multiple regression model that included Arm (t = 0.65, df= 147, p = 0.52), family social support (f = 0.86, df= 147, p = 0.39), residualized change 
score of physical activity from baseline to 12 months (f = 6.54, df= 147, p < 0.0001), and arm X family social support (t= 1.88, df= 147, p = 0.06) 
predicting moderate- to vigorous-intensity physical activity (Mod+) at 18 months. Using standard conventions for modeling interaction effects, 
the y-axis represents the predicted value at 18 months of physical activity (i.e., kilocalories burned per kilogram per day that was in the moderate 
to vigorous intensity range) for those high and low in social support at baseline between the automated advisor and human advisor groups. Low 
SS = one Standard deviation below mean for family social support; High SS = One standard deviation above the mean for family social support. 
This is based on previous conventions for graphing moderation analyses as described above. 



Our current research suggests that if individuals have 
low amotivation (i.e., higher initial interest in being ac- 
tive) at baseline, an automated advisor may actually be 
preferable over a human advisor. In addition, the auto- 
mated advisor has the potential to be more cost- 
effective and less restrictive for wide-scale adoption, 
adding to its potential utility [13]. Our results further sug- 
gest, however, that a human advisor may be required if 
someone has higher baseline levels of physical activity 
amotivation (i.e., less initial interest in being active), 
given its association with 12-month behavioral initiation, 
or higher baseline private self-consciousness (given its 
association with 18-month behavioral maintenance). 

Future research is indicated to better explain these results 
using appropriately designed research studies with a priori 
hypotheses. One potential hypothesis to explain these re- 
sults could be that even though human and computer 
advisors utilized the same content, the human advisors 
may have provided subtle but important customized sup- 
port and feedback to amotivated participants that were 
not provided by the automated advisor which proved 
valuable for counteracting this initial lack of interest. 
This hypothesis could be explored further by conducting 
content analysis of human advisor-participant interactions 
(see: [35] for methods). Such information, while not avai- 
lable for all counseling interactions in the current study, 
could potentially help to inform further development of 
automated system interactions [36]. If these subtle diffe- 
rences can be found and linked with improved partici- 
pant response, they could conceivably be better modeled 
into automated systems such as via improvements to the 
overall scripts and logic for amotivated individuals or doing 
work akin to Timothy Bickmore's work [37], which focuses 
on better building in subtle interaction styles within 
technology-based health promotion systems. In addition, 



this hypothesis implicitly acknowledges that amotivation 
might decrease more over time with the human advisor 
compared to the computer. Post hoc exploration did in- 
deed find a trend for this, whereby amotivation decreased 
in the human advisor arm and remained steady in the auto- 
mated advisor arm, but this trend was not significant. 
As such, this point on the stability of amotivation over 
time requires additional empirical evaluation. 

Interestingly, different psychosocial factors were key 
moderators for maintenance of physical activity. Speci- 
fically, individuals high in private self-consciousness (i.e., 
individuals who are more inwardly focused) had less fa- 
vorable maintenance effects if they were matched with 
an automated advisor. Future research should explore 
possible explanations for these results using a priori hy- 
potheses. For example, one hypothesis for explaining 
such results is that individuals who are more immersed 
in issues of the self may require more attentive re- 
sponses from the advisor to aid in longer-term physical 
activity maintenance. These results are in line with other 
results from the CHAT trial reported elsewhere [17] that 
more individuals in the human arm voluntarily called in 
during the 18-month follow-up period than those in the 
automated advice arm. Such increased advisor contact 
during the maintenance phase could be a potentially 
useful tool for those high in private self-consciousness. 
This hypothesis suggests that private self-consciousness 
is a relatively stable trait-like characteristic, which our 
own data suggested is true. On average, the standard devi- 
ation across time per individual on private self- 
consciousness was 2.4; for context the possible range of the 
scale is from 1-26. This stability of private self- 
consciousness requires further empirical evaluation. It 
would be worthwhile to explore whether these results could 
be replicated in other studies focused on maintenance. 
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Similar to the amotivation results, another possibility is that 
our human advisors provided subtle shifts in the script that 
proved valuable for those high in private self-consciousness. 
Based on this, similar exploration of these recorded advi- 
sors might prove valuable at identifying these improve- 
ments to the script that could then be incorporated into a 
future automated advisor script/logic. 

The trend towards a significant interaction effect be- 
tween family social support for exercise and the human 
advisor intervention arm in predicting 18-month phys- 
ical activity suggests that having higher initial family 
support may help to enhance the support being received 
by a human advisor across the longer term. Contrary to 
expectations, social support did not act as a "buffer" for 
reduced social contact among those individuals random- 
ized to the automated advice. Given the speculative na- 
ture of these observations based on only trend results, 
replication, particularly with a more stringent empirical 
evaluation and possibly different measures, is required 
before any firm conclusions or concrete hypotheses can 
be drawn. 

An alternative but perhaps equally important implica- 
tion of this work is the fact that the majority of the mod- 
erators explored, including all demographics (i.e., age, 
gender, ethnicity, and marital status), did not moderate 
the impact of the two interventions. We are cautious 
about making any strong conclusions from this and call 
for additional studies to confirm our findings. However, 
if these future studies replicate our findings then both 
human and an automated advisors may promote in- 
creases in physical activity, regardless of many if not 
most possible psychosocial moderator variables. As such, 
the decision between using a human vs. an automated 
advisor could be based on other issues beyond efficacy 
such as resource constraints. 

Implications 

Overall, the moderator variables explored in this study 
are reasonably short self-report measures and thus 
would be relatively easy to administer prior to providing 
an intervention in real-world settings. This allows for 
the possibility of an evidence-based, low cost method for 
identifying who will respond better to an automated ad- 
visor or a human advisor for physical activity promotion 
prior to starting an intervention program. This informa- 
tion, assuming replication of results with other human 
and automated advice comparisons, has the potential to 
be valuable for providing empirical evidence to support 
later clinical decision making and by extension improv- 
ing the overall efficacy/effectiveness of interventions. Of 
particular value for cost-effectiveness is the future ex- 
ploration of combining different intervention delivery 
media (e.g., human, automated advisor, smartphone, or 
webpages) to capitalize on the inherent strengths of each 



media while minimizing their potential weaknesses for 
physical activity promotion. Further, these results may 
be interpreted differently in other research communities 
such as human-computer interaction or communica- 
tions research, which might place stronger emphasis on 
understanding and delineating the impact of differences 
in communication style between humans vs. human- 
computer interactions [37,38]. As such, further explo- 
ration of these questions from a variety of different dis- 
ciplinary perspectives is required to develop more robust 
transdisciplinary theories for understanding behavior 
and behavior change technologies [39,40]. 

Limitations 

This investigation has several limitations. The study fo- 
cused on secondary data analyses of a previously 
conducted trial. As such, the sample was not explicitly 
powered to detect interaction effects and the overall sta- 
bility and generalizability of the results are difficult to 
determine. Despite the potential for reduced power, 
moderator effects were obtained. However, other theor- 
etically plausible mechanisms such as social anxiety may 
still moderate treatment efficacy but will need to be ex- 
plored in a larger sample. This point is also relevant for 
the trend found for family social support, which needs 
to be replicated using a larger, more properly powered 
trial. As suggested by King et al. [9], this type of explora- 
tory analysis is a key first step for generating hypotheses 
and is conceptually distinct from hypothesis testing en- 
deavors. The results provide guidance for subsequent 
hypothesis-testing endeavors [9]. Finally, this sample in- 
cluded individuals who signed up to be part of an inter- 
vention trial. As such, the amotivation variable would 
likely be more restricted within this sample than within 
the more general population. This restricted range may 
have at least partially contributed to the relatively low 
reliability estimates within the scale. 

Conclusion 

Future research should replicate the study results utili- 
zing appropriate statistical power to confirm the impact 
of the moderators explored here, including the possibi- 
lity of null results. In addition, future research should 
explore experimental studies testing the hypotheses ge- 
nerated by the authors from these secondary analyses 
(along with counter hypotheses generated by others for 
explaining the results). Some potential hypotheses to ex- 
plore include: (a) for individuals who report reduced 
interest in being active initially, a human advisor pro- 
vides subtle but important customized support and feed- 
back not provided by the automated advisor, which 
counteracts this initial lack of motivation; (b) individuals 
who are interested in discussing matters of the self (i.e., 
high in private self-consciousness) require the option of 
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contacting a human during the maintenance phase of an 
intervention; or (c) plausible moderator variables do not 
in fact moderate the impact of an intervention delivered 
either by a human or a computer, therefore allowing for 
resources to drive clinical decisions with regard to their 
use. In addition, other research could include conducting 
studies in which participants are differentially assigned (i.e., 
systematically matched versus mismatched) to a targeted 
intervention delivery channel based on key moderators (i.e., 
amotivation, private self-consciousness). Future research 
also should compare other common and potentially cost- 
effective treatment models (e.g., print-based media) along 
with other technologies such as smartphones [14] for the 
development of strategies for providing appropriately 
targeted interventions to the individuals who will respond 
best to them. As suggested by our different results for initi- 
ation vs. maintenance, future research should explore 
which delivery channel should be used at which time dur- 
ing the trial. For example, individuals could start with an 
initial meeting with a human advisor, transition to an auto- 
mated advisor, and then, during maintenance switch to a 
smartphone application. Additional empirical data would 
be useful to explore these different possibilities and combi- 
nations, using empirical design strategies such as a sequen- 
tial multiple assignment randomized trial (SMART) [41]. 
Through applying this type of systematic process, advances 
toward answering the "whiches conundrum" [i.e., identify- 
ing which message, for which person, using which delivery- 
mechanism, within which context, at which time-point 
[42]] may occur more quickly, resulting in a broadened 
impact on the publics health. 
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